This paper documents a model of the COVID-19 epidemic in South Africa. Mobility data is used to model the reproduction number of the COVID-19 epidemic over time using a Bayesian hierarchical model. Results are calibrated to reported deaths only. This is achieved by adapting the work in by Imperial College London researchers [1] for South Africa. Authors from Imperial College London have built similar models for Brazil [2] and the United States [3].
The model is based on data as compiled in [4]. The model uses mobility movement indexes by province produced by Google [5]. Furthermore the model includes a fixed effect for interventions introduced at the start of level 4 lockdown.
The model and report are automatically generated on a regular basis using R [6]. This version contains data available on 24 July 2020.
An online, regularly updated version of this report is available here.
As this paper is updated over time this section will summarise significant changes. The code producing this paper is tracked using git. The git commit hash for this project at the time of generating this paper was 42f76d422ac9d8f935879b7296881230b5490303.
2020-05-31
2020-06-01
2020-06-04
2020-06-08
2020-06-09
2020-06-10
2020-06-20
2020-06-22
2020-07-04
2020-07-07
2020-07-16
A detailed description of the methodology and assumptions is provided below below. The key features of the approach employed are summarised here:
The national mobility data from [5] is plotted below. The model uses the indexes at a provincial level but here the national indexes are plotted for convenience. Clear trends are observable:
The chart below summarises the average mobility and residential indexes used in the model. This is the average of mobility indicators excluding parks and residential. This follows [7].
There are some risks of the mobility data above as it may be biased to the Android operating system, people with smartphones and people with enough data on their smartphone to share their location.
This bias does not really represent a problem though unless it changes over time because then the model would not be able to produce accurate projections. There is a risk that mobility changes disproportionally between users contributing data to these indexes and those not. It would not seem reasonable to expect a major change in the bias over time (unless the calculation is somehow compromised). Provincial biases may exist, but the model also allows the provinces to exhibit provincial specific mobility effects for both parameters.
The average mobility (excluding residential & park indexes) is plotted below for the various provinces. From this plot it seems apparent that Western Cape mobility has reduced the most during lockdown and is, perhaps, increasing slower.
The sections below show how well the model is reproducing past death data. As the model is fitted to past death data we would want to see how well it’s reproducing such data. Each section covers a province.
Three panels are plotted for each province:
In all the charts the darker shaded area represents a confidence interval of 50% and the lighter shaded area represents a confidence interval of 95%.
In general, it is noted:
The Western Cape has the most reported deaths of all provinces and hence the most data to calibrate. The modelled infections are plotted below. It’s clear that modelled infections are far outpacing reported cases (brown).
Over the last 14 days it would appear that the Western Cape only tested 4.4% of all new infections. This figure seems quite low, but it seems that the growth in infection in the last number of weeks coincided with a large backlog of tests as well as a change in Western Cape testing policy. The Western Cape is limiting tests due to shortages to only higher risk groups [11].
The reported deaths (in brown) and modelled reported deaths (in blue) are plotted below. The Western Cape has rapidly increasing numbers of deaths. The model appears to be reasonable given the data. The data does seem quite variable from day to day which may be perhaps due to data processing delays causing clumping of reported deaths.
Below we plot the residuals by day. The projections for the Western Cape model are exceeding reported deaths in June. This has improved since the introduction of the mask wearing variable, but still is not great.
The \(R_{t,m}\) has actually reduced at the start of level 4 lockdown. Western Cape \(R_{t,m}\) has also been reducing over time to present levels.
Below modelled infections are plotted compared to confirmed cases. Over the last 14 days it would appear that 2.9% of all new infections were tested.
The reported deaths (in brown) and modelled reported deaths (in blue) are plotted below. Model for Eastern Cape also appears slightly on the high side but less so that the Western Cape.
Below we plot the residuals by day.
Below the \(R_{t,m}\) is plotted:
Recent spikes of deaths seem to be resulting in an increasing \(R_{t,m}\). Over the last 14 days it would appear that 2% of all new infections were tested.
The reported deaths (in brown) and modelled reported deaths (in blue) are plotted below.
Below we plot the residuals by day.
Below the \(R_{t,m}\) is plotted:
Recent spikes of deaths seem to be resulting in an increasing \(R_{t,m}\). Over the last 14 days it would appear that 1.9% of all new infections were tested.
The reported deaths (in brown) and modelled reported deaths (in blue) are plotted below.
Below we plot the residuals by day.
Below the \(R_{t,m}\) is plotted:
The other provinces have limited data.
Over the last 14 days it would appear that 2.3% of all new infections were tested.
The reported deaths (in brown) and modelled reported deaths (in blue) are plotted below.
Below we plot the residuals by day.
Below the \(R_{t,m}\) is plotted:
To understand the net parameter estimates for \(\alpha\) and \(\alpha_{m}^s\) and their impact on \(R_{t,m}\) we plot the percentage reduction in \(R_{t,m}\) assuming the particular index is 1 (representing a 100% reduction in average mobility).
This is equivalent to plotting \(1-2\cdot\phi^{-1}(-(\alpha+\alpha_{m}^s))\) for a particular province \(m\). We also plot \(1-2\cdot\phi^{-1}(-(\beta+\beta_{m}^s))\).
Confidence intervals are wide, though the average mobility index shows a big impact on \(R_{t,m}\) (assuming 100% reduction in mobility). Interventions introduced with level 4 lockdown also seem to be reducing \(R_{t,m}\) on average between 10% and 30%.
Estimates for \(R_{0,m}\) for each province are plotted below. It is clear that the posterior estimates for \(R_{0,m}\) is not heavily influenced by the data. This is probably due to the relatively early lockdown implemented in South Africa. There were probably not enough deaths that resulted from infection prior to lockdown to develop an independent estimate for each province of \(R_{0,m}\).
Current estimates and 95% confidence intervals for \(R_{t,m}\) (current reproduction number) are plotted below for each province. It’s clear that currently the values of \(R_{t,m}\) for some provinces now include 1 in the CIs. A value below 1 would indicate an epidemic that is slowing while a value above 1 indicates an epidemic that is growing. It is clear that the spread of the epidemic is somewhat slowed compared to the initial \(R_{0,m}\).
The wide confidence intervals would indicate that we need to wait for the epidemic to further develop to include more data in our models. The confidence interval for Western Cape is narrowing already.
The estimated attack rate (with 95% confidence intervals) is tabulated below. This is the proportion of the population infected to date. This figure has to be estimated, because many that are infected experience no or mild symptoms, thus they may not seek medical advice and hence will never be tested.
Western Cape has the highest estimated prevalence to date, but with fairly wide confidence intervals. Eastern Cape has the second highest prevalence.
The Western Cape figures may seem high but a recent presentation indicate positive rates from 20% to 40% during the final weeks of May 2020 depending on sub-district and whether it’s private or public health facilities doing the tests [12]. The public sector testing does seem closer to a 30% proportion positive at present from the graphs in that presentation. This information is not equivalent to an attack rate, but may be indicative of high numbers of infections being modelled here, especially those that have occurred relatively recently.
| Province | Attack Rate |
|---|---|
| EC | 35.78% [19.80%-56.02%] |
| GP | 43.44% [23.27%-67.43%] |
| KZN | 28.96% [11.82%-55.81%] |
| WC | 23.03% [14.88%-34.81%] |
| OTH | 10.57% [3.64%-24.92%] |
| South Africa | 26.92% [19.01%-36.48%] |
One of the reasons we build models is so that we can make sense of the future or indeed the past. We can project forward models to assess the impact of varying assumptions on future outcomes. This gives us a sense of how changes in actions may impact the future. I.e. it allows us to answer “what if” questions. Note however that in projecting the future we are extrapolating, and due care needs to be taken. There are numerous limitations to this model and these projections dicussed below but the author is also of the opinion that the projections add value in that they indicate a significant range across the scenarios projected and in such a manner inform discussion.
All models are wrong but some are useful - George Box [13]
An incorrect model can be useful because it enables a better understanding of the model and the phenomena being modelled. This section should be used with caution because, as we show in calibration and backtesting, the model seems to be projecting higher deaths than observed for the Western Cape and Eastern Cape.
Note detailed projection output can be found here.
The first projections holds mobility constant at current levels which would be associated with level 3 of lockdown. Level 4 interventions are left intact.
The result of this scenario as at 31 December 2020 is tabulated below.
| Province | Attack Rate | Reported Deaths | Peak Daily Reported Deaths | Peak Reported Date | Deaths | Peak Daily Deaths | Peak Date |
|---|---|---|---|---|---|---|---|
| EC | 59.20% [50.31%-67.89%] | 5 749 [ 3 012- 9 859] | 83 | 2020-08-11 | 25 815 [ 20 087- 31 989] | 374 | 2020-08-09 |
| GP | 71.84% [64.70%-78.82%] | 9 595 [ 4 130-17 576] | 184 | 2020-08-13 | 48 242 [ 38 608- 58 497] | 932 | 2020-08-11 |
| KZN | 70.86% [61.75%-78.60%] | 7 333 [ 2 836-13 730] | 133 | 2020-08-23 | 37 987 [ 29 863- 46 536] | 691 | 2020-08-20 |
| WC | 37.83% [25.26%-49.88%] | 7 524 [ 4 956-10 862] | 65 | 2020-08-06 | 12 570 [ 8 357- 17 254] | 110 | 2020-08-05 |
| OTH | 68.97% [59.94%-77.27%] | 12 429 [ 4 603-23 341] | 216 | 2020-09-08 | 64 494 [ 50 102- 79 889] | 1 120 | 2020-09-06 |
| South Africa | 65.33% [60.99%-69.57%] | 42 631 [30 002-57 732] | 592 | 2020-08-23 | 189 107 [168 295-211 003] | 2 792 | 2020-08-22 |
This scenario assumes future mobility half-way between current mobility levels (associated with level 3 of lockdown) and normal levels. Level 4 interventions are left intact. The attack rate and deaths after increasing mobility are tabulated below (as at 31 December 2020).
| Province | Attack Rate | Reported Deaths | Peak Daily Reported Deaths | Peak Reported Date | Deaths | Peak Daily Deaths | Peak Date |
|---|---|---|---|---|---|---|---|
| EC | 67.40% [54.82%-75.63%] | 6 603 [ 3 231-11 757] | 93 | 2020-08-20 | 29 409 [ 22 238- 36 436] | 406 | 2020-08-18 |
| GP | 81.14% [73.37%-86.62%] | 10 928 [ 4 423-20 542] | 215 | 2020-08-18 | 54 515 [ 43 163- 66 286] | 1 049 | 2020-08-16 |
| KZN | 82.93% [71.46%-89.13%] | 8 616 [ 3 183-16 160] | 184 | 2020-08-25 | 44 470 [ 34 915- 54 211] | 935 | 2020-08-24 |
| WC | 47.27% [24.12%-63.81%] | 9 443 [ 4 924-14 789] | 77 | 2020-09-02 | 15 731 [ 8 128- 22 928] | 128 | 2020-09-01 |
| OTH | 80.22% [69.01%-87.75%] | 14 488 [ 5 342-27 395] | 313 | 2020-09-05 | 75 031 [ 57 803- 92 295] | 1 607 | 2020-09-04 |
| South Africa | 75.68% [69.07%-80.23%] | 50 079 [34 413-68 489] | 811 | 2020-08-28 | 219 157 [192 455-246 048] | 3 767 | 2020-08-27 |
Based on the above an increase mobility could mean roughly 30 000 more deaths by the end of the year.
This scenario assumes future mobility at average levels seen during level 4 lockdown. This may occur either through actual reinstatement of level 4, or perhaps can be considered as a possibility if increased mobility does not result in as much increase in the reproductive number.
The attack rate and deaths after decreased mobility are tabulated below (as at 31 December 2020).
| Province | Attack Rate | Reported Deaths | Peak Daily Reported Deaths | Peak Reported Date | Deaths | Peak Daily Deaths | Peak Date |
|---|---|---|---|---|---|---|---|
| EC | 54.01% [43.44%-64.99%] | 5 196 [ 2 875- 8 515] | 80 | 2020-08-08 | 23 537 [ 17 804- 29 965] | 368 | 2020-08-07 |
| GP | 65.81% [56.07%-76.07%] | 8 703 [ 3 963-15 399] | 176 | 2020-08-10 | 44 165 [ 34 586- 54 676] | 901 | 2020-08-09 |
| KZN | 56.14% [43.79%-69.90%] | 5 709 [ 2 484-10 405] | 104 | 2020-08-15 | 30 064 [ 21 925- 39 186] | 566 | 2020-08-13 |
| WC | 33.66% [23.21%-45.06%] | 6 681 [ 4 705- 9 285] | 65 | 2020-08-03 | 11 184 [ 7 795- 15 197] | 109 | 2020-08-03 |
| OTH | 50.82% [36.65%-63.82%] | 9 015 [ 3 557-16 865] | 122 | 2020-09-04 | 47 365 [ 32 254- 63 496] | 659 | 2020-09-01 |
| South Africa | 54.08% [47.82%-60.00%] | 35 305 [25 684-47 048] | 504 | 2020-08-14 | 156 316 [134 236-178 882] | 2 404 | 2020-08-13 |
Based on the above a decrease in mobility could mean roughly 33 000 fewer deaths by the end of the year.
Below we plot projections for the Western Cape.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
Below we plot the projections for Eastern Cape.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
Below we plot the projections for Gauteng.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
Below we plot the projections for KwaZulu-Natal.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
Below we plot the projections for the provinces other than, Western Cape, Eastern Cape, Gauteng and KwaZuly-Natal.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
We plot below the results for South Africa as a whole. This is the sum of the provincial projections.
The next 30 days the attack rate and deaths are increasing rapidly. The model predicts that within 30 days it is possible that deaths will exceed 500 per day and that more than 10% of the South African population may be infected.
The attack rate and deaths over a longer period for both the constant mobility and increased mobility scenarios are plotted below:
In the sections below the model is backtested. The backtesting assumes perfect knowledge of the mobility indexes to date, but tests the models with the 14 or 28 most recent days of death data excluded. The test is to see how well the model predicts forward over time.
Below we plot the backtesting for 14-days of recent data excluded as indicated by the black dotted line. The model is predicting more deaths for the Western Cape and Eastern Cape.
We provide residual plots for the above below, but it’s only really useful for the Western Cape as other provinces have many days of no deaths. For the Western Cape it seems that the model is projecting higher than the reported deaths.
Below we plot the backtesting for 28-days of recent data excluded as indicated by the black dotted line. Again the model is improved from previous version but still over-predicting for Western Cape and Eastern Cape. Observations do fall in the wide confidence intervals.
Below we plot the residuals for the above.
This analysis has various limitations:
From the results it’s clear that \(R_{t,m}\) in all provinces has reduced from the starting values of \(R_{0,m}\) and this has slowed the spread of the epidemic in South Africa saving lives. However, in Europe the lockdowns have been able to reduce \(R_{t}\) below 1 for various countries [17]. As shown here, South Africa’s lockdown and other interventions have not been as successful as in most the European countries shown in [17].
Mobility has increased somewhat following the commencement of level 4 lockdowns. This results in an increasing the \(R_{t,m}\) in various provinces since the start of May resulting in a corresponding increase in deaths two to four weeks later.
Mobility has increased further during level 3 lockdown. This results in an increasing the \(R_{t,m}\) in various provinces since the start of May. The death data corresponding to this increased \(R_{t,m}\) is not significant as yet and we will soon see whether this model is handling this accurately, or if potentially through other interventions the Level 3 is not increasing \(R_{t,m}\). Other mitigations may dampen the effect of increasing mobility.
Projections on current mobility levels would already result in significant peaks in deaths. Further relaxation of lockdowns will result in increases in mobility, which would worsen the reproduction numbers.
Based on the modelling it is expected to result in roughly 30 000 more deaths by the end of the year. This ignores other impacts such as ICU availability and the impact on deaths from other causes which would be expected to increase these figures, but also the impact the homogeneity assumption which might be exacerbating this number.
Reducing mobility could result in roughly -33 000 fewer deaths by the end of the year. This could happen via reverting lockdown to level 4 or, possibly, if other non-mobility interventions act as a dampener on any increases in the reproduction number. Again, this ignores other impacts such as ICU availability and the impact on deaths from other causes.
The large differences between the various provinces was surprising. Given the high prevalence in the Western Cape it may be prudent to enforce some screening or travel restrictions out of the province.
The IFR is not treated as a parameter but as a constant with random noise. Changes to the IFR will change the modelled infections that correlate with the observed deaths. Sensitivity to the IFR could be modelled.
Using mobility data is useful to not only measure the impact of government interventions but also include the societal response to those interventions (as they affect mobility). This means that changes in the reaction to new regulations can be modelled. It may also be useful going forward as many new regulations are introduced possibly at a provincial level to summarise the impact of interventions numerically.
This paper projects mobility changes forward. We can see that as the model assigns some value to initiatives introduced at the start of level 4 around mask wearing and workplace screening. These are reducing the impact of increasing mobility such that in the Western Cape the \(R_{t,m}\) actually reduced in level 4 compared to level 5.
It is difficult to know how this will continue forward especially given that the projections for the Western Cape is still exceeding the observations both in calibration and in backtesting.
When health system capacity is under ressure in various provinces it would seem likely that the mortality rate of infected individuals in those provinces would increase. This model needs to take account of that for two reasons:
The model does not currently do this.
Backtesting shows reasonable performance of the model at least over a shorter 28-day and 14-day time periods. There is some bias in which appears to be over-predicting reported deaths in the last week or two.
Some possible reasons are:
These will be investigated to the extent possible.
The model is still projecting too high deaths for Western Cape and Eastern Cape in calibration and/or backtesting.
The author intends to investigate the introduction of autoregressive process as per [3] to model further residuals that cannot seem to be captures by existing parameters. These may be reflecting other NPIs, general population awareness and behaviour adjustment or other unmodelled disease effects.
Other areas to investigate in this regard:
Along with the above, the author intends to investigate the following:
The model assumes that current reproduction number, \(R_{t,m}\), is a function of the initial reproduction number, \(R_{0,m}\), and mobility changes over time. It then calculates infections as a function of \(R_{t,m}\) over time, and then, using these infections calculates deaths from the infections based on a distribution of time to death. Various prior distributions are assumed. The model structure is identical to that in [1] but is briefly documented below. The parameters are estimated jointly using a single hierarchical model covering all provinces. This means that data in different provinces are combined to inform all parameters in the model. As per [1], fitting was done in R using Stan with an adaptive Hamiltonian Monte Carlo sampler.
The model is documented here but is as per in [1], apart from the simplification of mobility indexes to a single index. The model assumes a base reproduction number (\(R_{0,m}\)) for each province (\(m\)) and then it models future values of the reproduction number using mobility indexes as follows:
\(R_{t,m}=R_{0,m}\cdot2\cdot\phi^{-1}(-(\alpha+\alpha_{m}^s)I_{t,m}^{\alpha}-(\beta+\beta_{m}^s)I_{t,m}^{\beta})\)
Here:
Infections are modelled as:
\(c_{t,m}=S_{t,m}R_{t,m}\sum_{\tau=0}^{t-1}c_{\tau,m}g_{t-\tau}\) where \(S_{t,m}=1 - \frac{\sum_{i=0}^{t-1}c_{i,m}}{N_{m}}\).
Infections, \(c_{t,m}\) at time \(t\) are a function of proportion of population not yet infected (\(S_{t,m}\)), the reproduction number (\(R_{t,m}\)) and infections prior to that \(c_{\tau,m}\) as well as an infectiousness curve \(g_{t-\tau}\). \(N_m\) is the population in province \(m\).
Deaths, \(d_{t,m}\) are modelled as:
\(d_{t,m}=ifr_{m}^*\sum_{\tau=0}^{t-1}c_{\tau,m}\pi_{t-\tau}\)
Here:
We model completeness of reporting as random noise from a Beta distribution:
\(\psi_m \sim B(\mu_mv_m,(1-\mu_m)v_m)\)
Here:
Reported deaths, \(d'_{t,m}\) are then:
\(d'_{t,m}=d_{t,m}\psi_m\)
Reported daily deaths \(D'_{t,m}\) are assumed to have the following distribution:
\(D'_{t,m} \sim Negative\ Binomial(d'_{t,m},d'_{t,m}+ \frac{{d'}_{t,m}^2}{\phi})\)
If \(Y \sim N(\mu,\sigma)\) then we define \(N^{+}\) to mean the distribution of \(|Y| \sim N^{+}(\mu,\sigma)\).
We assume that:
\(\phi \sim N^{+}(0,5)\)
Then: \(d'_{t,m}=E(D'_{t,m})\)
The following assumptions are taken as is from [1] except for where indicated.
We add random noise to the IFR as follows:
\(ifr_{m}^*=ifr_{m}\cdot N(1,0.1)\)
\(\alpha\) & $$ is normally distributed with a 0 mean:
\(\alpha \sim N(0,0.5)\)
\(\beta \sim N^{+}(0,1)\)
The above distribution for \(\beta\) has been increased to reflect [8] which indicated an effect of mask wearing. In this paper a relative risk of 0.56 is indicated for mask wearing in non-health-care settings. The above prior would indicate a 0.62 expected relative risk. \(\beta\) is not a parameter in the model used in [1].
Then for the provincial specific index effects we use:
\(\alpha_{m}^s \sim N(0,\gamma^{\alpha})\) with \(\gamma^{\alpha} \sim N^{+}(0,0.5)\)
\(\beta_{m}^s \sim N(0,\gamma^{\beta})\) with \(\gamma^{\beta} \sim N^{+}(0,0.5)\)
\(\beta_{m}^s\) is a parameter in the model used in [1].
The \(R_{0,m}\) are defined to be distributed normally as follows:
\(R_{0,m} \sim N(3.28,\kappa)\) with \(\kappa \sim N^{+}(0,0.5)\)
Infectiousness follows this distribution:
\(g \sim Gamma(6.5,0.62)\)
Time to death follows this distribution:
\(\pi \sim Gamma(5.1,0.86)+Gamma(17.8,0.45)\)
The distribution of time to death is plotted below.
The above implies an average time to death of 23 days.
Death and case data were used from [4]. This data set contains, amongst other, provincial case and deaths data digitised mainly from daily tweets by National Institute of Communicable Diseases (NICD).
We note again that the model calibrates to only the deaths. The reason for not calibrating to confirmed cases is that the bias in the testing is unknown. The degree to which testing has focussed on symptomatic and people seeking medical treatment or hospital treatment is unknown and could have changed over time. This would present a biased estimate and would require adjustment in this model.
Based on limited death data, provinces were aggregated as follows:
The IFR (\(ifr_{m}\)) for each province was calculated using the output of the squire R package [18]. It produces an age-specific infection attack rates (IAR), infections and deaths. The per age band IFR were used together with the per age band IAR and these were applied to provincial populations [19]. The IFRs from squire package are based on [20], [21] and [18].
The projection was done doing the default parameters for South Africa and the resultant attack rate (\(a_{x}\)) and IFR (\(ifr_{x}\)) for each 5-year age band was obtained (\(x\)).
Additionally HIV prevalence by age-band and province, \(i_{x,m}^{HIV}\) was taken into account from [22] as well as results from [12]. In [12] it is shown that lives with HIV have higher COVID-19 mortality once infected. Based on the these results we assume three times COVID-19 mortality below 55 and double mortality from age 55 upwards (\(m_x^{HIV}\)).
These \(a_{x}\) and \(ifr_{x}\) and HIV adjustments were then applied to the South African population per province and age band as per [19] and weighted IFRs calculated as follows.
\(ifr_{m}=\frac{\sum_{x}N_{x,m} \cdot a_{x} \cdot ifr_{x}( (1-i_{x,m}^{HIV}) +i_{x,m}^{HIV} \cdot m_x^{HIV})}{\sum_{x}N_{x,m}}\)
Here \(N_{x,m}\) is the population in a particular province (\(m\)) and age band (\(x\)).
Below we tabulate the resultant \(ifr_{m}\):
| Province | IFR |
|---|---|
| EC | 0.64% |
| GP | 0.44% |
| KZN | 0.47% |
| WC | 0.5% |
| OTH | 0.5% |
| South Africa | 0.49% |
The differences between provinces reflect the different age profiles in those provinces as per [19]. This seems low compared to the IFRs in [15], but may be reasonable given the younger profile of the South African population.
Based excess deaths estimated in [23], [16], [9] and [24] and reported deaths in [4] one can estimate the completeness of reporting. Below excess deaths as derived in [9] over 6 May 2020 to 14 July 2020 is compared with reported deaths for the same period (per [4]). The assumptions is that 90% of excess deaths are COVID-19 related.
For Northern Cape we have assumed the same ratio as per Eastern cape, Gauteng and KwaZulu-Natal combined.
| Province | Excess Deaths | Reported Deaths | COVID-19 Excess | Completeness |
|---|---|---|---|---|
| EC | 3299 | 546 | 2969.1 | 0.18 |
| GP | 2779 | 463 | 2501.1 | 0.19 |
| KZN | 1079 | 169 | 971.1 | 0.17 |
| WC | 3694 | 2074 | 3324.6 | 0.62 |
| OTH | NA | NA | NA | 0.18 |
Google has released aggregated mobility data for various countries and for sub-regions in those countries. These data is generated from devices that have enabled the Location History in Google Maps. This feature is available both on Android and iOS devices but is off by default.
For South Africa, these data contain the mobility indexes for each province. These are described in [5] as follows:
These measure relative changes in mobility in above dimensions relative to a baseline established before the epidemic. For example, -30% implies a 30% reduction in mobility from pre-COVID-19 mobility.
As per [7] these data were combined into an average mobility index for each province which was an average of all mobility indexes excluding:
A residential index is also included.
In [1] three indexes were used (Residential, Transit and the rest). This was reduced for this paper due to limited data.
We calculated indexes for OTH by weighting the individual provinces by population.
[1] M. Vollmer et al., “Report 20: A sub-national analysis of the rate of transmission of COVID-19 in Italy,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-20-italy/
[2] T. Mellan et al., “Report 21: Estimating COVID-19 cases and reproduction number in Brazil,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-21-brazil/
[3] H. Unwin et al., “Report 23: State-level tracking of COVID-19 in the United States,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-23-united-states/
[4] V. Marivate et al., “Coronavirus disease (COVID-19) case data - South Africa.” Zenodo, 2020 [Online]. Available: https://zenodo.org/record/3888499
[5] Google LLC, “Google COVID-19 community mobility reports.” 2020 [Online]. Available: https://www.google.com/covid19/mobility/
[6] R Core Team, R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, 2019 [Online]. Available: https://www.R-project.org/
[7] P. Nouvellet et al., “Report 26: Reduction in mobility and COVID-19 transmission,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-26-mobility-transmission/
[8] D. K. Chu et al., “Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: A systematic review and meta-analysis,” The Lancet, vol. 395, no. 10242, pp. 1973–1987, Jun. 2020, doi: 10.1016/S0140-6736(20)31142-9. [Online]. Available: https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(20)31142-9/fulltext
[9] D. Bradshaw, R. Laubscher, R. Dorrington, P. Groenewald, and T. Moultrie, “Report on weekly deaths in South Africa 1 January - 7 July 2020 (Week 27),” Burden of Disease Research Unit, South African Medical Research Council [Online]. Available: https://www.samrc.ac.za/sites/default/files/files/2020-07-15/WeeklyDeaths7July2020_0.pdf
[10] J. D. Goodman and M. Rothfield, “1 in 5 New Yorkers may have had Covid-19, antibody tests suggest,” New York Times [Online]. Available: https://www.nytimes.com/2020/04/23/nyregion/coronavirus-antibodies-test-ny.html
[11] A. Winde, “Update on the coronavirus by Premier Alan Winde.” Western Cape Government [Online]. Available: https://coronavirus.westerncape.gov.za/news/update-coronavirus-premier-alan-winde-6-june
[12] M.-A. Davies, “Brief update on surveilance of COVID-19 infections in the Western Cape.” 03-Jun-2020. [Online]. Available: http://www.medicine.uct.ac.za/covid19-echo-clinic
[13] G. E. P. Box, “Robustness in the strategy of scientific model building,” in Robustness in statistics, Elsevier, 1979, pp. 201–236 [Online]. Available: https://doi.org/10.1016/b978-0-12-438150-6.50018-2
[14] Independent Communications Authority of South Africa, “The state of the ICT sector report in South Africa,” 2020 [Online]. Available: https://www.icasa.org.za/legislation-and-regulations/state-of-the-ict-sector-in-south-africa-2020-report
[15] G. Meyerowitz-Katz and L. Merone, “A systematic review and meta-analysis of published research data on COVID-19 infection-fatality rates” [Online]. Available: https://www.medrxiv.org/content/10.1101/2020.05.03.20089854v3
[16] D. Bradshaw, R. Laubscher, R. Dorrington, P. Groenewald, and T. Moultrie, “Report on weekly deaths in South Africa 1 January - 30 June 2020 (Week 26),” Burden of Disease Research Unit, South African Medical Research Council [Online]. Available: https://www.samrc.ac.za/sites/default/files/files/2020-07-08/WeeklyDeaths30June2020.pdf
[17] S. Flaxman et al., “Report 13: Estimating the number of infections and the impact of non-pharmaceutical interventions on COVID-19 in 11 European countries,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-13-europe-npi-impact/
[18] P. Walker et al., “Report 12: The global impact of COVID-19 and strategies for mitigation and suppression,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-12-global-impact-covid-19/
[19] Statistics South Africa, “Mid-year population estimates 2019,” Republic of South Africa, 2019 [Online]. Available: https://www.statssa.gov.za/publications/P0302/P03022019.pdf
[20] R. Verity et al., “Estimates of the severity of coronavirus disease 2019: A model-based analysis,” The Lancet Infectious Diseases, vol. 20, no. 6, pp. 669–677, Jun. 2020, doi: 10.1016/s1473-3099(20)30243-7. [Online]. Available: https://www.thelancet.com/journals/laninf/article/PIIS1473-3099(20)30243-7/fulltext
[21] N. Ferguson et al., “Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand,” Imperial College London, 2020 [Online]. Available: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/covid-19/report-9-impact-of-npis-on-covid-19/
[22] L. Johnson, R. Dorrington, and H. Moolla, “Progress towards the 2020 targets for hiv diagnosis and antiretroviral treatment in south africa,” Southern African Journal of HIV Medicine, vol. 18, no. 1, p. 8, 2017, doi: 10.4102/sajhivmed.v18i1.694. [Online]. Available: https://sajhivmed.org.za/index.php/hivmed/article/view/694
[23] D. Bradshaw, R. Laubscher, R. Dorrington, P. Groenewald, and T. Moultrie, “Report on weekly deaths in South Africa 1 January - 23 June 2020 (Week 25),” Burden of Disease Research Unit, South African Medical Research Council [Online]. Available: https://www.samrc.ac.za/sites/default/files/files/2020-07-01/WeeklyDeaths23June2020_0.pdf
[24] D. Bradshaw, R. Laubscher, R. Dorrington, P. Groenewald, and T. Moultrie, “Report on weekly deaths in South Africa 1 January - 14 July 2020 (Week 28),” Burden of Disease Research Unit, South African Medical Research Council [Online]. Available: https://www.samrc.ac.za/sites/default/files/files/2020-07-01/WeeklyDeaths23June2020_0.pdf